翻訳と辞書
Words near each other
・ Wrapped distribution
・ Wrapped exponential distribution
・ Wrapped in a Dream
・ Wrapped in Red
・ Wrapped in Red (song)
・ Wrapped in Ribbon
・ Wrapped Lévy distribution
・ Wrapped normal distribution
・ Wrapped Tight
・ Wrapped Up
・ Wrapped Up Good
・ Wrapped Up in Pinstripes
・ Wrapped Up in You
・ Wrapper
・ Wrapper (clothing)
Wrapper (data mining)
・ Wrapper (philately)
・ Wrapper function
・ Wrapper library
・ Wrapping
・ Wrapping (graphics)
・ Wrapping (overflow)
・ Wrapping Paper
・ Wrapping tissue
・ Wrapports
・ WRAR
・ WRAR (AM)
・ WRAR-FM
・ WRAS
・ WRAS (FM)


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

Wrapper (data mining) : ウィキペディア英語版
Wrapper (data mining)
Wrapper in data mining is a program that extracts content of a particular information source and translates it into a relational form.〔Nicholas Kushmerick, Daniel S. Weld, Robert Doorenbos, (''Wrapper Induction for Information Extraction'' ) Proceedings of the International Joint Conference on Artificial Intelligence, 1997〕 Many web pages present structured data - telephone directories, product catalogs, etc. formatted for human browsing using HTML language. Structured data are typically descriptions of objects retrieved from underlying databases and displayed in Web pages following some fixed templates. Software systems using such resources must translate HTML content into a relational form. Wrappers are commonly used as such translators. Formally, a wrapper is a function from a page to the set of tuples it contains.
==Wrapper generation==
There are two main approaches to wrapper generation: wrapper induction and automated data extraction.
Wrapper induction uses supervised learning to learn data extraction rules from manually labeled training examples. The disadvantages of wrapper induction are
* the time-consuming manual labeling process and
* the difficulty of wrapper maintenance.
Due to the manual labeling effort, it is hard to extract data from a large number of sites as each site has its own templates and requires separate manual labeling for wrapper learning.
Wrapper maintenance is also a major issue because whenever a site changes the wrappers built for the site
become obsolete. Due to these shortcomings, researchers have studied automated wrapper generation using
unsupervised pattern mining. Automated extraction is possible because most Web data objects follow fixed
templates. Discovering such templates or patterns enables the system to perform extraction automatically.〔Liu, B. Web ''Data Mining: Exploring Hyperlinks, Contents and Usage Data'', Springer, 2007.〕
Wrapper generation on the Web is an important problem with a wide range of applications. Extraction of such data enables one to integrate data/information from multiple Web sites to provide value-added services, e.g., comparative shopping, object search, and information integration.
–the wrapper content can be enhanced

抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「Wrapper (data mining)」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.